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DETAILED ACTION 

Response to Arguments 

1 . Applicant's arguments filed 1 0/1 5/07 have been fully considered but they are not 

persuasive. 

Applicant argues that Weare et al., do not teach filtering each first frequency 
domain representation of blocks of said media program using a plurality of filters to 
develop a respective second frequency domain representation of each of said blocks of 
said media program, said second frequency domain representation of each of said 
blocks having a reduced number of frequency coefficients with respect to said first 
frequency domain representation (Amendment, pages 10-12). 

The examiner disagrees, Weare et al., teach identifying all peaks in the FFT 
frame. The scalable critical band masking phase filters and removes any peaks that 
masked by surrounding peaks with more energy (col. 18, lines 49 -52, and 59 -61). 
Removing any peaks that are masked by surrounding peaks with more energy implies 
filtering each first frequency domain representation of blocks of said media program 
using a plurality of filters to develop a respective second frequency domain 
representation of each of said blocks of said media program, said second frequency 
domain representation of each of said blocks having a reduced number of frequency 
coefficients with respect to said first frequency domain representation, since all the 
removed peaks are first identified in the FFT frame. 
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Applicant argues tliat Laroclie does not teach reducing number of frequency 
coefficients (Amendment, pages 15, and 16). 

The examiner disagrees, Laroche teaches calculating a short-term Fourier 
transform on the incoming signal. The magnitudes of the FFT bins are summed within 
predefined frequency bands, and the results are processed by a number of frequency 
coefficients (col.2, lines 54 - 59). Filtering the short-term Fourier transform of the 
incoming signal implies reducing number of frequency coefficients, since only frequency 
components in the subband are obtained after the number of frequency coefficients. 



Claim Rejections - 35 USC § 102 

2. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 1 02 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1 ) an application for patent, published under section 1 22(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

3. Claims 1 -11, 14-16, 18 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Weare et a!., (US Patent 7,065,416). 

Regarding claim 1,21, 22, 23 Weare et al. discloses a method for use in 
recognizing the content of a media program (see col. 6, lines 22-27), said method 
comprising the steps of: filtering each first frequency domain representation of blocks of 



said media program using a plurality of filters to develop a respective second frequency 
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domain representation of eacli of said blocl^s of said media said second frequency 
domain representation of each of said blocl<s liaving a reduced number of frequency 
coefficients with respect to said first frequency domain representation program (see col. 
16, lines 47, fig. 7, element 750, describing a critical band filtering step which can be 
modeled as a filter bank, thus indicating that a plurality of filters exist); 

grouping frequency coefficients of said second frequency domain representation 
of said blocks to form segments (see fig. 8A element 804, col. 17, lines 57-60, and col. 
16, lines 25-30, where critical band filtering forms several critical bands, interpreted by 
the examiner as groups); and selecting a plurality of said segments (see col. 18, lines 
10-15, where the peaks with the highest energies are selected); 

comparing selected segments to features of stored programs to identify thereby 
said media program ("classification of media entities"; Abstract, lines 3 -7). 

Regarding claim 2, Weare et al. discloses the invention as defined in claim 1 
wherein each grouping of frequency coefficients of said second frequency domain to 
form a segment represents blocks that are consecutive in time in said media program 
(see. Col. 18, lines 10-15, since the peaks with highest energies are selected it follows 
that the segments may be contiguous in time if two highest peaks are positioned 
consecutively). 

Regarding claim 3, Weare et al. discloses t2he invention as defined in claim 1 
wherein said plurality of filters are arranged in a group that processes a block at a time. 
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the portion of Said second frequency domain representation produced by said group for 
each block forms a frame, and wherein at least two frames are grouped to form a 
segment (see col. 18, where peaks last for multiple frames, thereby having a segment 
at least two frames). 

Regarding claim 4, Weare et al. discloses the invention as defined in claim 1 
wherein said selected segments correspond to portions of said media program that are 
not contiguous in time (see col. 18, lines 10-15, since the peaks with the highest 
energies are selected, it follows that the segments may not be contiguous if a peak that 
does not meet the criteria "highest" is positioned between two "highest" peaks). 

Regarding claim 7, Weare et al. discloses the invention as defined in claim 1 
wherein the segments selected in said selecting step are those that have largest 
minimum segment energy (see col. 18, lines 10-15). 

Regarding claim 8, Weare et al. discloses the invention as defined in claim 1 
wherein the segments selected in said selecting step are selected in accordance with 
prescribed constraints such that said segments are prevented from being too close to 
each other (see col. 18, line 66 - col. 19 line 2, where only selecting peaks that last for 
more than specified number of frames prevents the peaks from being too close). 
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Regarding claim 9, Weare et al. discloses the invention as defined in claim 1 
wherein the segments selected in said selecting step are selected for portions of said 
media program that correspond in time to prescribed search windows that are 
separated by gaps (see col. 19, lines 5-10 where frames correspond to search windows, 
and the frames are individual thus, there is a separation by gaps). 

Regarding claim 10, Weare et al. discloses the invention as defined in claim 1 
wherein the segments selected in said selecting step are those that result in the 
selected segments having a maximum entropy over the selected segments (see col. 18, 
lines 12-15, where the most energetic peaks are chosen, thus choosing the most 
entropic peaks). 

Regarding claims 11-13, Weare et al discloses the Invention as defined In claim 
1 further comprising the step of normalizing said frequency coefficients In said second 
frequency domain representation after performing said grouping step, said 
normalization being performed on a per-segment basis; wherein said normalization 
Includes performing at least a preceding-time normalization; an L2 normalization 
("normalizing the sum"; see col. 16, lines 3-6). 

Regarding claim 14, Weare et al. discloses the invention as defined in claim 1 
further comprising the step of storing said selected segments in a database in 
association with an identifier of said media program (see col. 7, lines 59-65, where 
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music is stored in a database and for generating play lists thus an identifier must be 
associated with the stored data). 

Regarding claim 15, Weare et al. discloses the invention as defined in claim 14 
further comprising the step of storing in said database information indicating timing of 
said selected segments (see col. 9, lines 16-21, where classifying the tempo in the 
database indicates timing of media segment). 

Regarding claim 16, Weare et al. discloses the invention as defined in claim 1 
wherein said first frequency domain representation of blocks of said media program is 
developed by the steps of: digitizing an audio representation of said media program to 
be stored in said database (see col. 16, lines 41-44); dividing the digitized audio 
representation into blocks of a prescribed number of samples (see col. 16, lines 41-44, 
where the audio representation is divided into frames); smoothing said blocks using a 
filter (see col. 16, lines 45-47); and 

converting said smoothed blocks into the frequency domain, wherein said 
smoothed blocks are represented by frequency coefficients (see col. 16, lines 39- 41). 

Regarding claim 18, Weare et al. discloses the invention as defined in claim 16 
wherein each of said smoothed blocks are converted into the frequency domain in said 
converting step using a Fast Fourier Transform (FFT) (see col. 16, lines 39-41 and col. 
23, lines 52-54). 
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3. Claims 24-29, 31 , 32, 34 - 37 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Laroclie (US Patent 6,453,252). 

Regarding claims 24, 34, 35, Laroche discloses a method for use in recognizing 
the content of a media program, said method comprising the steps of filtering each first 
frequency domain representation of blocks of said media program using a plurality of 
filters to develop a respective second frequency domain representation of each of said 
blocks of said media said second frequency domain representation of each of said 
blocks having a reduced number of frequency coefficients with respect to said first 
frequency domain representation program( see fig. 1 and col. 2, lines 36-48); 

grouping frequency coefficients of said second frequency domain representation 
of said blocks to form segments(see col. 2, lines 46-48); and searching a database for 
substantially matching segments, said database having stored therein segments of 
media programs and respective corresponding program identifiers (see col. 4, lines 33- 
34). 

Regarding claim 25, Laroche discloses the invention as defined in claim 24 
further comprising the step of indicating that said media program cannot be identified 
when substantially matching segments are not found in said database in said searching 
step (see col. 4, lines 38-42, where the value indicates if there is a true match or not). 
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Regarding claim 26, Laroche discloses the invention as defined in claim 24 
wherein said data base includes information indicating timing of segments of each 
respective media program identified therein (see col. 4, line 64- col. 5, line 5), and 
wherein a match may be found in said searching step only when the timing of said 
segments produced in said grouping step substantially matches the timing of said 
segments stored in said database (see col. 5, lines 5-10, where fingerprints tal<en at 
other maxima will not fit, thus the match will only be found when the timing segments 
match). 

Regarding claim 27, Laroche discloses the invention as defined in claim 24 
wherein said matching between segments is based on the Euclidean distances between 
segments (see col. 4, lines 34-38). 

Regarding claim 28, Laroche discloses the invention as defined in claim 24 
further comprising the step of identifying said media program as being the media 
program indicated by the identifier stored in said database having a best matching score 
when substantially matching segments are found in said database in said searching 
step (see col. 4, lines 38-42, where the match is determined by the smallest value, 
where larger values may match substantially, but are not indicated as the best match). 



Regarding claim 29, Laroche discloses the invention as defined in claim 28 
further comprising the step of determining a speed differential between said media 
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program and a media program identified in said identifying step (see col. 3, lines 64-67, 
where two signals can differ by a slowly time-varying function). 

Regarding claim 31 Laroche discloses the invention as defined in claim 28 further 
comprising the steps of: repeating said filtering, grouping, searching and identifying; and 
determining, in the event of another match, whether said identified program is the same 
program determined prior to said repetition or a different program (see col. 5, lines 29- 
32, where the program is implemented in software stored on a computer readable 
medium, allowing the program to be repeated whenever necessary). 

Regarding claim 32, Laroche discloses the invention as defined in claim 31 
wherein said determining step is based on an overlap score (see claim 6, where an 
identifying method is claimed based on a segment divided into overlapping frames). 

Regarding claim 36, Laroche discloses the invention as defined in claim 35 
wherein said first frequency domain representation of said media program comprises a 
plurality of blocl<s of coefficients corresponding to respective time domain sections of 
said media program (see col. 2, lines 36-40) and said second frequency domain 
representation of said media program comprises a plurality of blocks of coefficients 
corresponding to respective time domain sections of said media program (see col. 2, 
lines 42-48). 
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Regarding claim 37, Laroclie discloses a computer readable storage arranged to 
store segments derived from, and representative of, various media programs, said 
segments of each respective one of said media programs being stored in said database 
so as to be associated with a respective media program identifier (see col. 4, lines 33- 
38, where the database is composed of known material, which must have an identifier); 

segments is developed by filtering a first frequency domain representation of said 
media program using a plurality of filters to develop a second frequency domain 
representation of said media program having a reduced number of frequency 
coefficients in said second frequency domain representation with a unique to said first 
frequency domain representation ( see fig. 1 and col. 2, lines 36-48); 

grouping ones of said second frequency domain representation to form said 
segments (see col. 2, lines 46-48). 

Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the phor art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

5. Claims 5, 6, 17, and 19 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Weare et al., (US Patent 7,065,416). 

Regarding claim 5, Weare et al. discloses the limitations of claim 1 as discussed 
above. Weare et al. does not disclose wherein said plurality of filters includes at least a 
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set of triangular filters. However using triangular filters as a smoothing filter is well 
known in the art, because the greater the parameter, the greater the degree of 
smoothing, providing good suppression of higher frequencies. Thus it would have been 
obvious to one of ordinary skill in the art to use triangular filters for smoothing. 

Regarding claim 6, Weare et al. discloses the limitations of claim 1 as discussed 
above. Weare et al. does not disclose wherein said plurality of filters includes at least a 
set of log-spaced filters. However using log-spaced triangular filters is well known in the 
art. The log function provides compression of the dynamic range of filterbank output 
energies. Thus it would have been obvious to one of ordinary skill in the art at the time 
the invention was made to use a set of log-spaced filters. 

Regarding claim 17, Weare et al. discloses the invention as defined in claim 16 
as discussed above. Weare et al. does not disclose wherein said filter used in said 
smoothing step is a Hamming window filter. However this feature is well known in the 
art as indicated by the applicant's disclosed specification. Applicant discloses that 
those of ordinary skill in the art will recognize that Hamming window filter or Hanning 
window may be employed for smoothing. Thus it would have been obvious to one of 
ordinary skill in the art at the time the invention was made to employ Hamming window 
filter. 
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Regarding claim 19, Weare et al. discloses the limitations of the invention as 
defined in claim 16 as discussed above. Weare et al. does not disclose wherein each of 
said smoothed blocks are converted into the frequency domain in said converting step 
using a Discrete Cosine Transform (DCT). However this feature is well known in the art 
as indicated by the applicant's disclosed specification. Applicant discloses that those of 
ordinary skill in the art will recognize that discrete cosine transform can be used in the 
place of fast Fourier transform to covert the time domain to frequency domain (see page 
1 1 , lines 1 5-1 7). Thus it would have been obvious to one of ordinary skill in that art at 
the time the invention was made to use discrete cosine transform to convert blocks from 
the time domain to frequency domain. 

6. Claims 30, and 33 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Laroche (US Patent 6,453,252). 

Regarding claim 30, Laroche discloses the limitations of the invention as defined 
in claim 28 as discussed above. Laroche does not disclose wherein said matching score 
for a program P.sub.i is determined by Pi=lzj=lzf(Sj'-Sj(Pi)). However this feature is well 
known in the art. This equation is a measure of the average distance. It produces the 
same results as calculating the Euclidean distance. Thus it would have been obvious to 
one of ordinary skill in the art to use this equation to produce a matching score. 

Regarding claim 33, Laroche discloses the invention as defined in claim 32 as 
discussed above. Laroche does not disclose wherein overlap score is calculated 
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between said program determined prior to said repetition, PO, and said program 
determined during said repetition, PI, is calculated as Overlap score=(t.sub.end- 
t.sub.begin)/(end time of Pl-beginning time of PI) where t.sub.end is min(end time of 
PO, PI); and t.sub.begin is max(beginning time of PO, PI). However this feature is well 
l<nown in the art. The overlap score is an indication of how much time is shared by two 
items. This is an obvious calculation of overlap, thus it would have been obvious to one 
of ordinary skill in the art at the time the invention was made to use this equation to 
calculate overlap. 



Conclusion 

6. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 .136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 
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Any inquiry concerning tliis communication or earlier communications from the 
examiner should be directed to LEONARD SAINT CYR whose telephone number is 
(571) 272-4247. The examiner can normally be reached on Mon- Friday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571) 272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is (571)- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

LS 

03/16/08 



/Richemond Dorvil/ 

Supervisory Patent Examiner, Art Unit 2626 



